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In this article we derive upper bounds on the decoder error probability for Reed- 
Solomon codes . By definition, “decoder error” occurs when the decoder finds a code- 
word other than the transmitted codeword , this is in contrast to “decoder failure,” which 
occurs when the decoder fails to find any codeword at all Our results imply, for example, 
that for a t error-correcting Reed-Solomon code of length q - 1 over GF( q), if more than 
t errors occur, the probability of decoder error is less than 1/tl In particular, for the 
Voyager Reed-Solomon code, the probability of decoder error given a word error is 
smaller than 3 X 10~ 14 . Thus , in a typical operating region with probabiliy 10~ 5 of word 
error, the probability of undetected word error is about 10~ 19 . 


I. Introduction 

Deep Space missions, including Voyager, use error-correct- 
ing codes to allow very low probability of error in messages 
from the spacecraft to earth, even though signal-to-noise 
ratios are very low One of the digital coding schemes used by 
Voyager is shown in Fig. 1 . This scheme allows a bit -error rate 
as low as 10“ 6 with a bit signal-to-noise ratio EjN 0 as low as 
2 3 dB. 

Besides low decoder bit -error rate, we would like a system 
with very low probability of undetected error. That means 
that, in case there are bit errors in a certain region, we would 
like to know that. (Of course, knowing exactly where the bit 
errors are is equivalent to having no bit errors, which is impos- 
sible. But we would like to know that a certain string of bits 


contains errors or, preferably, that it is very unlikely to con- 
tain errors.) 

Our Reed-Solomon code is a (255,223) 8-bit code. This 
means that bits are arranged in symbols of 8 bits each, which 
are in turn arranged in words of 255 symbols, of which 223 
symbols are information and the other 32 are parity. Whenever 
our decoders detect 16 or fewer symbol errors, they correct 
these errors. But if the string going into the decoder differs 
from every codeword in at least 17 symbols, we are able to 
detect this, and we know that many symbols (and therefore 
many bits) in the word are in error. We would like to know the 
probability that, in the fairly rare instance that 17 or more 
symbol errors are made, the decoder makes further correction 
and therefore incorrectly reports successful decoding. This 
article shows that this probability is less than 3 X 10” 14 . 
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Let C be an (n, A:) code over GF(q ), with minimum dis- 
tance d. We assume C is being used to correct t errors, where t 
is a fixed integer satisfying 2 1 < d - 1 . We further assume the 
decoder is a bounded distance decoder, i.e., it looks for a 
codeword within distance t of the received word; if there is 
such a codeword, the decoder finds it, and if not, the decoder 
reports “failure.” 

If the transmitted codeword suffers t or fewer errors, it will 
be decoded correctly. If, on the other hand, it suffers more 
than t errors, one of two things can happen. Either the decoder 
will fail to find a codeword ( decoder failure ), or it will find a 
codeword other than the transmitted codeword ( decoder 
error). We denote by P F and P E the probabilities of decoder 
failure and error, respectively. Of course if the number of 
errors is t or less, P F -P E = 0* ^ the number of errors exceeds 
/, but is less than d - T, then P F = 1 and P E = 0, since fewer 
than d - t errors occunng cannot move the transmitted code- 
word to within distance t of another codeword. 

If d - t or more errors occur, it is in general quite difficult 
to calculate, or even estimate, P F and P E , although if the code 
is being used in a practical communications system, it is impor- 
tant to do so. A useful heuristic estimate can be based on the 
assumption that if at least d - t errors occur, the error pattern 
can be treated as if it were completely random. The probabil- 
ity that a completely random error pattern will cause decoder 
error (i.e., lie within distance t of a nonzero codeword) is given 
by 

(q k - 1) • V (0 

Q = — = (Q- r -Q~ n )V n (t) (1) 

<7” 

where r - n - k is the code’s redundancy and 

r.O’Z (") <■’-!>■ «> 

5=0 

is the volume of a Hamming sphere of radius t This argument 
leads to the following estimate for P E * 

P„ ^ Q • ?r {> d - t errors) (3) 

h. 

It is difficult to justify this estimate in general, but in this 
article we will see that if we increase Q slightly by defining 
Q' as 

Q' = (q- 1)-'K„(0 (4) 


then for Reed-Solomon codes, 

P £ < Q* • Pr{>d - Terrors) (5) 

In fact Eq (5) will follow from more detailed results, which 
we now describe. 

If q u denotes the probability that the error pattern has 
weight u, then plainly 


P E = Yj P E^% ( 6a ) 

u~0 


p F - £ ( 6b ) 
u=0 

where P E (u) and P F (u) denote the conditional probabilities of 
decoder error and failure, respectively, given u channel errors 
As mentioned above, we have P E (u) =P F (u) = 0 for u < t and 
P E (u) ~ ^f( w ) = lfor t <u <d -t For u>d -t we have 

P F (u) + P E {u) = 1, and so if P E {u) is known, P F {u) can be 
calculated, and vice versa. 

Here is our main result. Let C be an (n, k) Reed-Solomon, 
or any other maximum distance separable (MDS) code, with 
minimum distance d = n - k + 1 We assume as above that the 
code is being used to correct t errors, for some fixed value of t 
with 2t<d- \ We further assume that the code is being used 
on a channel for which all error patterns of the same weight 
are equiprobable, for example, a #-ary symmetric channel. 
Under these assumptions, we shall prove in Section III that 

P E (u) = 0 for u < d - t - 1 (7a) 

t 

(“)<(? - l) - '' (”) - 1 foid-t<u<d- 1 

s-d-u 

(7b) 

P e (u)<Q’ for u>d (7c) 

Of course Eq. (7a) needs no further proof; it is included only 
to make the bounds in Eq. (7) apply to all values of u. The 
bound (7b) actually follows from a slightly sharper, but more 
complicated bound on P E {u) that appears in Section III as 
Eq. (15). 
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We can combine Eqs (7b) and (7c), at the cost of weaken- (It would be nice to have a uniform lower bound on the 

ing Eq. (7b) slightly, to obtain an upper bound on P E (u) P E (u)% but we have been unable to find one.) 

which is uniform in u for u > d - t : 


P E (u) <Q’ for u>d-t (8) 

The ratio of this uniform bound Q ' to the heuristic estimate 
Q in Eq. (1) is usually very close to 1 , and is always less than 
(q!(q - 1))”, which for n < q - 1 cannot exceed e = 2.718 
In any event, combining Eq. (6a) with (7a) and (8), we obtain 
the bound (5). 


Although as a practical matter it is not hard to compute 
the bound Q ' numerically, for some applications it may be 
worthwhile to have a simpler, though weaker, bound In the 
Appendix, we show that Eq. (8) implies that provided n < 
q - 1 , for all u > d - t, 


J — + — L 


(q-iy - 2 (q-iy 


P E (U)< 


1 _ 


-2 1 t\ 


if r = 1 


if f > 2 


( 9 ) 


Since r > 2t in all cases, Eq (9) implies, whenever n < q - 1 , 


P £ ( u ) < — for all u > t + 1 


( 10 ) 


Kasami and Lin (Ref. 2) have also studied the problem of 
decoder error for Reed-Solomon codes. They showed that on a 
q - ary symmetric channel/^ is at most Q , i.e., that 

E ^(«)(")e"(l-e)"- u <Q (11) 

u=d-t ' ' 

where e is the probability of channel symbol error. They fur- 
ther showed that P E = Q only when e = (q - 1 )/<?, i.e., when 
the error pattern is completely random. This shows that Q is 
the tightest possible bound on the sum in Eq. (11) which is 
independent of e. However, except when the probabilitity of 
> d - t errors is very nearly 1 , our bound (5) will be smaller 
than Kasami and Lin’s bound (11). And since most well- 
designed systems will have Pr{w > d - t } « 1 , we conclude 
that our bound is likely to be more useful in practice than 
Kasami and Lin’s. 

Finally we note that since with e = (q - l)/q equality 
holds in Eq. (11), the average of the P E (uys with respect to 
one particular probability distribution is Q. Since P E (u) is 0 
for u < d - f, it follows that for some values of u,P E (u)> Q . 
Thus the conjecture that P E (u) < Q for all u isn’t tenable. 


II. Preliminaries 

In this section we will review some known results about 
MDS codes which are needed in our proof. Our remarks will 
be self-contained, but proofs may also be found in Ref. 4, 
Chapter 11 . 

Let C be a code, not necessarily linear, of length n with 
q k codewords over GF(q). If we examine any set of k - 1 
components of the codewords, we find that there are only 
q k ~ ] possibilities for the q k codewords. Thus there must be a 
pair of codewords which agree on these k - 1 components, and 
so the minimum distance d of the code must satisfy d < 
n - k + 1 A code for which d = n - k + 1 is called a maximum- 
distance separable (MDS) code. By this definition, Reed- 
Solomon codes and cosets of Reed-Solomon codes are MDS 
codes. 

Let K be a subset of k coordinate positions of an MDS 
code If two codewords were equal on K, the distance between 
them would be at most n - k. But this is impossible, since 
d = n - k + 1 We conclude that all q k codewords are different 
on K, and so, for any possible A;-tuple of elements from GF(q ), 
say a = (eq , a 2 , . . . , a k ) t there is a unique codeword, which, 
when restricted to K, equals a. This important fact we call the 
basic combinatorial property of MDS codes. 

We now wish to estimate the number of codewords of 
weight n, for u > d, in an MDS code. A word of weight u must 
vanish on a set of v = n - u coordinates. Thus let Fbe an arbi- 
trary subset of v coordinates. We will estimate the number of 
codewords that vanish on V. Since u>d, then v < k - 1 . Thus 
by the basic combinatorial property, if we specify that the 
codeword is zero exactly on V, we may specify (k - 1 >) other, 
nonzero, components arbitrarily. There are (q - \) k ~ v = 
(( q - l) u ~ r ways to do this, and so there are at most (q - l) w-r 
codewords that vanish exactly on V. Since there are (” ) = ( " ) 
possibilities for V, if A u denotes the number of codewords of 
weight m, we have: 


(<7- 1)w - r for u>d (12) 

Next we let V be a subset of v coordinate positions, where 
v > k. If we project the original code onto V, the result will be 
a certain (v,fc) code. Since the parent ( n,k ) code has d = 
n - k + 1 , the new code must have distance d' >d - (n - v) = 
v - k + 1. Since it is impossible for d f to be greater than 
v - k + 1 , equality must hold and it follows that the projected 
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code is a (v, k) MDS code. This simple fact will be referred to 
in the proof in the next section. 


that the number of codewords with weight won V is at most 
(^) (q - l) w ~ r> , where r = r - u is the redundancy of the 
restricted code. 


III. Proof of Results 

We call a word, not necessarily a codeword, decodable if it 
lies within distance t of some codeword. If D u denotes the 
number of decodable words of weight u , then for u > t + 1 , 
we have, assuming that all error patterns of weight u are 
equiprobable, 


For each codeword C with weight w in V , we must count 
the number of E’ s such that C + E vanishes on V. Suppose that 
E has weights s > w. On F, E must match C exactly, but the 
( s - w) other nonzero components can be arbitrarily placed 
outside F Thus the total number of £” s, for a given C of 
weight w, is t 


P E (U) = 



(13) 


Thus the problem of finding the P E (uys is essentially the same 
as that of finding the weight enumerator for the set of decod- 
able words. For example, Eq. (7c) is equivalent to 


D u <( n u )(q-l) u - r VJJ) ioxu>d (14) 


The plan is to obtain upper bounds on D u which will imply 
our various bounds on P E (u). We need to distinguish two 
cases, u > d and u < d - 1 . 

First we assume u > d. Each decodable word can be written 
uniquely as C + E, where C is a codeword and E is a word of 
weight < t. For a fixed E , as C runs through the set of code- 
words, {C + E} is a coset of the RS code. Since any coset of 
a RS code is an MDS code, by Eq (12) we know that the num- 
ber of words of weight u is less than or equal to (”)(<?- l) M “ r , 
since we are assuming u>d . Since the set of decodable words 
is the disjoint union of V n (t) cosets of the RS code, Eq (14) 
(and therefore Eq. [7c] ) follows. 

Now we assume that u < d - 1. A decodable word of 
weight u will vanish on a set of size v = n - u. For each of the 
(") subsets V of v coordinates, we will obtain an upper bound 
on the number of decodable words of weight u that vanish on 
V. This upper bound will imply Eq. (7b). 


S=W 


Therefore the total number of decodable words vanishing on V 
is at most 

w-d-u s- w 

■ E «•-«• E C) (.-») 

s-d~u w-d—u 

This is a bound on the number of decodable words of weight u 
vanishing on V. If we multiply it by the number of possible 
subsets V with v elements, viz. (") = (2) we obtain a bound on 
D u , and hence by Eq. (1 3), 

V-xta-.r E <«-»' E 0 

s-d-u w-d—u 

(15) 

This bound is a bit clumsy for everyday use, but we note in 
passing that for u = d - t (the smallest value of u for which 
P E {u) isn’t 0) it simplifies to 


^(d-0<(?-l)- (d - f - 1) ( M ‘^ + f ) (16) 


As before, we will use the fact that each decodable word is 
of the form C + £, where Cis a codeword and E has weight > t. 
If the sum C + E vanishes on V 9 then C must have weight < t 
on V, say weight w. We note that w = 0 isn’t possible, since 
u > t + 1. By our remarks in Section II, we know that C 
restricted to V is a linear (v, k) MDS code, and so its minimum 
weight (distance) is d - u. Thus w, the weight of C on F, 
satisfies d - u < vv < t. (If d - u > r, there are no such words, 
this gives another proof of Eq. [7a] .) By Eq. (12), it follows 


which is in fact the exact value of P E (u) in this case (Ref. 1). 

Finally, we simplify the bound (15) by recalling a well- 
known combinatorial identity (Ref. 3, Eq. [1 .2.6 21])* 


Ed(/J -CD 

w>0 
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Since v + u = n, this means that the inner sum in Eq (15) is at 
most ), and so Eq. (7b) follows from Eq. (15). 


IV. Numerical Results 

Using Eq. (8) we are able to compute an upper bound to 
the probability of undetected error, given that a word has 
more than t errors. In the case of the Voyager Reed-Solomon 
code, this gives an upper bound of 2.97 X 10~ 14 . Better 


bounds depend on knowing the error probability so that the 
expected number of errors can be taken into account For 
example, the probability of undetected error, given that a 
word has exactly 17 errors (the smallest number that the 
code is unable to decode correctly) is 1 09 X 10~ 14 (see 
Eq. [16]). This means that at a low error rate, when most 
words which fail to decode have exactly 17 errors, the prob- 
ability of undetected word error given that a word fails to 
decode correctly, can be as low as 1.1 X 10' 14 . In any case, 
we have very good confidence in those words which do 
decode. 
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Fig. 1. A digital data coding scheme 
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Appendix 


In this appendix we will derive several useful approxima- 
tions to the bound Q* that appears in Eq. (8). In fact all of 
our results will follow from bounds on the binomial sum 
V n {t) defined in Eq. (2). 

It follows from results in Ref. 5 (Appendix A, Eqs. [A-5} 
and [A-9])that 

<*■» 

provided the denominator within the braces is positive. (This 
will certainly be the case in our application, since 2t + 1 < 
n and q > 2.) Now if we assume that n < q (which holds for 
all RS codes and all but a few exotic MDS codes described in 


Chapter 11 of Ref. 4), the term within the braces will be 
< (n + l)/rc. Thus we have 

(A-2) 

Since (n + l)(n - 1) < n 1 , it follows from Eq. (A-2) that 

for f > 2 (A -3) 

If n < q - 1, the bound (A-3) immediately implies Eq. (9) 
for t > 2. The case t = 1 in Eq. (9) must be handled separately, 
and follows from the fact that V n (1 ) = 1 + n(q - 1 ) is less than 
or equal to 1 + (q - l) 2 , if n < q - 1 . 
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